Picture for Bin Zhu

Bin Zhu

RoboTrustBench: Benchmarking the Trustworthiness of Video World Models for Robotic Manipulation

Add code
Jun 01, 2026
Viaarxiv icon

Beyond Task Success: Behavioral and Representational Diagnostics for WAM and VLA

Add code
May 31, 2026
Viaarxiv icon

A Finite-Calibration Regime Map for LLM Judge Panels

Add code
May 31, 2026
Viaarxiv icon

OSP-Next: Efficient High-Quality Video Generation with Sparse Sequence Parallelism, HiF8 Quantization, and Reinforcement Learning

Add code
May 27, 2026
Viaarxiv icon

Spatiotemporal Sycophancy: Negation-Based Gaslighting in Video Large Language Models

Add code
Apr 20, 2026
Viaarxiv icon

Learning ECG Image Representations via Dual Physiological-Aware Alignments

Add code
Apr 02, 2026
Viaarxiv icon

Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design

Add code
Mar 30, 2026
Viaarxiv icon

Teacher-Student Diffusion Model for Text-Driven 3D Hand Motion Generation

Add code
Mar 25, 2026
Viaarxiv icon

OSCBench: Benchmarking Object State Change in Text-to-Video Generation

Add code
Mar 12, 2026
Viaarxiv icon

Verified Multi-Agent Orchestration: A Plan-Execute-Verify-Replan Framework for Complex Query Resolution

Add code
Mar 12, 2026
Viaarxiv icon